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(~| . Abstract 



This paper deals with the local asymptotic structure, in the sense of 
Le Cam's asymptotic theory of statistical experiments, of the signal detec- 
tion problem in high dimension. More precisely, we consider the problem 



> 

■ of testing the null hypothesis of sphericity of a high-dimensional covariance 

d 



matrix against an alternative of (unspecified) multiple symmetry-breaking 
I directions {multispiked alternatives). Simple analytical expressions for the 

asymptotic power envelope and the asymptotic powers of previously pro- 



^ ■ posed tests are derived. These asymptotic powers are shown to lie very 

substantially below the envelope, at least for relatively small values of the 
number of symmetry-breaking directions under the alternative. In contrast, 
the asymptotic power of the likelihood ratio test based on the eigenvalues of 
the sample covariance matrix is shown to be close to that envelope. These 
results extend to the case of multispiked alternatives the findings of an ear- 
lier study (Onatski, Moreira and Hallin, 2011) of the single-spiked case. The 
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methods we are using here, however, are entirely new, as the Laplace ap- 
proximations considered in the single-spiked context do not extend to the 
multispiked case. 

Key words: sphericity tests, large dimensionality, asymptotic power, spiked 
covariance, contiguity, power envelope. 

1 Introduction 

In a recent paper, Onatski, Moreira and Hallin (2011) (hereafter OMH) analyze 
the asymptotic power of statistical tests in the detection of a signal in spherical 
real-valued Gaussian data as the dimensionality of the data and the number of 
observations diverge to infinity at the same rate. This paper generalizes OMH's 
alternative of a single symmetry- breaking direction {single-spiked alternative) to 
the alternative of multiple symmetry- breaking directions {multispiked alternative), 
which is more relevant for applied work. 

Contemporary tests of sphericity in a high-dimensional environment (see Ledoit 
and Wolf (2002), Srivastava (2005), Schott (2006), Bai et al. (2009), Chen et al. 
(2010), and Cai and Ma (2012)) consider general alternatives to the null of spheric- 
ity. Our interest in alternatives with only a few contaminating signals stems from 
the fact that in many applications, such as speech recognition, macroeconomics, 
finance, wireless communication, genetics, physics of mixture, and statistical learn- 
ing, a few latent variables typically explain a large portion of the variation in high- 
dimensional data (see Baik and Silverstein (2006) for references). As a possible 
explanation of this fact, Johnstone (2001) introduces the spiked covariance model, 
where all eigenvalues of the population covariance matrix of high-dimensional data 
are equal except for a small fixed number of distinct "spike eigenvalues." The alter- 
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native to the null of sphericity considered in this paper coincides with Johnstone's 
model. 

The extension from the single-spiked alternatives of OMH to the multi-spiked 
alternatives considered here, however, is all but straightforward. The difficulty 
arises because the extension of the main technical tool in OMH (Lemma 2) , which 
analyzes high-dimensional spherical integrals, to integrals over high-dimensional 
real Stiefel manifolds obtained in Onatski (2012) is not easily amenable to the 
Laplace approximation method used in OMH. Therefore, in this paper, we de- 
velop a completely different technique, inspired from the large deviation analysis 
of spherical integrals by Guionnet and Maida (2005). 

Let us describe the setting and main results in more detail. Suppose that 
the data consist of n independent observations Xf, t = 1, of a p-dimensional 
Gaussian vector with mean zero and positive definite covariance matrix E. Let 
E = (Jp + VHV) , where Ip is the p-dimensional identity matrix, cr is a scalar, H 
an r xr diagonal matrix with elements hj > 0, j = 1, r along the diagonal, and V 
a (p X r)-dimensional parameter normalized so that V'V — Ir- We are interested 
in the asymptotic power of tests of the null hypothesis Hq : hi = ... = hr = 
against the alternative Hi : hj > for some j = 1, ...,r, based on the eigenvalues 
of the sample covariance matrix of the data when n,p — > oo so that p/n ^ c 
with < c < oo, an asymptotic regime which we abbreviate into n^ p -^^ oo. The 
matrix V is an unspecified nuisance parameter, the columns of which indicate the 
directions of the perturbations of sphericity. 

We consider the cases of specified and unspecified u^. For the sake of simplicity, 
in the rest of this introduction, we only discuss the case of specified cr^ = 1, 
although the case of unspecified cr^ is more reahstic. Denoting by \j the j-th largest 
sample covariance eigenvalue, let A — (Ai,...,Am), where m — min(n,p). We 
begin our analysis with a study of the asymptotic properties of the likelihood ratio 
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process {L {h; A) ; /i e [O, JiY} , where h — {hi, hr) , h e [0, a/c) and L {h; A) is 
defined as the ratio of the density of A under Hi to that under Hq, considered as 
a A-measurable random variable. Note that L {h; A) depends on n and p, while 
A is m = min {n, p} -dimensional. An exact formula for L {h; A) involves the 
integral /^(p) e*''*^^*^^*^') (dQ) over the orthogonal group 0{p), where the p x p 
matrix A has a deficient rank r. In the single-spiked case (r = 1), OMH link 
this integral to the confluent form of the Lauricella function, and use this link to 
establish a representation of the integral in the form of a contour integral (see Wang 
(2010) and Mo (2011) for independent different derivations of this contour integral 
representation for this particular r — 1 case). Then, the Laplace approximation to 
the contour integral is used to derive the asymptotic behavior of L {h; A). 

Onatski (2012) generalizes the contour integral representation to the multi- 
spiked case (r > 1). For complex- valued data, such a generalization allows him 
to extend OMH's results to the multi-spiked context. Unfortunately, for real- 
valued data, which we are concerned with in this paper, this generalization is not 
straightforwardly amenable to the Laplace approximation method. Therefore, in 
this paper, we consider a totally different approach. For the r = 1 case, Guionnet 
and Maida (2005) (hereafter GM) use large deviation methods to derive a second- 
order asymptotic expansion of J^^^^ gtr(AQBQ') ^^q-^ ^-^^ non-zero eigenvalues of A 
diverge to infinity (see their Theorem 3). We extend GM's second-order expansion 
to the r > 1 case, and use that extension to derive the asymptotics of L {h; A). 

More precisely, we show that, for any h such that < h < a/c, the sequence 
of log-likehhood processes {In L{h;\);he [0,hY} converges weakly to a Gaussian 
process {Cx{h); h e [O, hY} under the null hypothesis Hq as n,p -^c oo. The index 
A in the notation C\{h) is used to distinguish the limiting A- log- likelihood process in 
the case of specified = 1, from that of the /x-log-hkelihood process considered in 
the case of unspecified cr^, which we denote by C^{h) (see Section 2). The limiting 
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process has mean E[Cx{h)] = | Yl ln(l — hihj/c) and autocovariance function 

Gov (jOx {h) ,Cx(h)^ = — I Ylij=i^^ ~ hihj/cj . That convergence entails the 
weak convergence, in the Le Gam sense, of the /i-indexed statistical experiments £^ 
under which the eigenvalues Ai, are observed, i.e. the statistical experiments 
with log-hkelihood process {inL;^ {h);he [O, hy} (see van der Vaart (1998), page 
126). Although this limiting process is Gaussian, it is not a log-hkelihood process 
of the Gaussian shift type, so that the statistical experiments £™ under study 
are not locally asymptotically normal (LAN) ones. The weak convergence of 
imphes, however, via Le Gam's first lemma (see van der Vaart 1998, p. 88), that 
the joint distributions of the normalized sample covariance eigenvalues under the 
null and under alternatives associated with h e [0, y/c) are mutually contiguous. 

An asymptotic power envelope for A-based tests of Hq against Hi can be con- 
structed using the Neyman-Pearson lemma and Le Cam's third lemma. We show 
that, for tests of size a, the maximum achievable asymptotic power against a point 



alternative h — (hi, hr) equals 1 — $ (1 — a) — ^/W(h) , where $ is the 
standard normal distribution function and W{h) = — | Yllj^i In (1 — hihj/c). As 
we explain in the paper, this asymptotic power envelope is valid not only for the 
A-based tests, but also for all tests that are invariant under left orthogonal trans- 
formations of the data Xt, t = 1, n. 

Next, we consider previously proposed tests of sphericity and of the equality of 
the population covariance matrix to a given matrix . We focus on the tests studied 
in Ledoit and Wolf (2002), Bai et al (2009), and Gai and Ma (2012). We find that, 
in general, the asymptotic powers of those tests are substantially lower than the 
corresponding asymptotic power envelope value. In contrast, our computations for 
the case r — 2 show that the asymptotic powers of the A- and //-based hkelihood 
ratio tests are close to the power envelope. 
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The rest of the paper is organized as follows. Section 2 estabhshes the weak 
convergence of the log-likelihood ratio process to a Gaussian process. Section 3 
provides an analysis of the asymptotic powers of various sphericity tests, derives 
the asymptotic power envelope, and proves its validity for general invariant tests. 
Section 4 concludes. All proofs are given in the Appendix. 

2 Asymptotics of likelihood ratio processes 

Let X be a pxup matrix with independent Gaussian N (0, cr^ {Ip + VHV')) columns. 
Let Api > ... > Xpp be the ordered eigenvalues of ^XX' and write Xp = {Xpi, Xpm) , 
where m — mm{p,np}. Similarly, let /x^j = Apj/(Api-|-...-|-App) , i — l,...,m and 

As explained in the introduction, our goal is to study the asymptotic power, 
as np,p — >c of the eigenvalue-based tests of Hq : hi — ... — hr — against 
Hi : hj > for some i = 1, r, where hj are the diagonal elements of the diagonal 
matrix H. If is specified, the model is invariant with respect to left and right 
orthogonal transformations; sufficiency and invariance arguments (see Appendix 
5.4 for details) lead to considering tests based on Ap only. If is unspecified, 
the model is invariant with respect to left and right orthogonal transformations 
and multiplications by non-zero scalars; sufficiency and invariance arguments (see 
Appendix 5.4) lead to considering tests based on /x^ only. Note that the distribution 
of /Ip does not depend on cr^, whereas, if is specified, we can always normalize 
Xp dividing it by a^. Therefore, we henceforth assume without loss of generality 
that 0-2 = 1. 

Let us denote the joint density of Xpi,...,Xpm ai x = (xi, x^) e (R"*")"* 
as fxp {x; h), and that of //^i, at y = (yi, ym-i) e (M^)"""^ as /^p (y; h). 
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We have 

|p-np|-l 
2 

'i 



0{p) 



where 7 depends only on Up and p; 11 = diag ((1 + hi)~^ , (1 + h^Y^ , 1, l); 
X = diag (xi, Xm, 0, 0) is a {p y< p) diagonal matrix; O (p) is the set of all 
p X p orthogonal matrices; and (dQ) is the invariant measure on the orthogonal 
group O (p), normalized to make the total measure unity. Formula ([T]) is a special 
case of the density given in James (1964, p. 483) for Up > p, and follows from 
Theorems 2 and 6 in Uhlig (1994) for Hp < p. 

Let X = Xi + ... + Xm and let i/i = Xi/x. Note that the Jacobian of the coordinate 
change from (xi, ...,Xm) to {yi, ...,ym~i,x) is x"^^^. Changing variables in ([1]) and 
integrating x out, we obtain 

|p-np|-l 

n,=i (1 + Jo ^ 

(2) 

where 3^ = diag {yi, ...,ym, 0, 0) is a (p x p) diagonal matrix. 

Consider the likelihood ratios Lp {h; Xp) = f\p (Ap; h) / fxp (A^; 0) and Lp (/i; /i) = 
ffip [f^p] h) / f^p (^fip] 0). Formulae ([1]) and ([2]) imply the following proposition. 

Proposition 1 Let O [p) be the set of all p x p orthogonal matrices. Denote by 
[dQ) the invariant measure on the orthogonal group O {p) normalized to make the 
total measure unity. Put Ap = diag (Api, Xpp) , 5*^ = Api + ... + Xpp, and let Dp 
be the p x p diagonal matrix diag ^2c~T+h7' 2c~T+h~' ^) ' '"^^^^^ — p/^p- 
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Then, 




In the special case where r = 1, the rank of the matrix Dp equals one, and the 
integrals over the orthogonal group in ([3]) and (jl]) can be rewritten as integrals over 
a j9- dimensional sphere. OMH show how such spherical integrals can be represented 
in the form of contour integrals, and apply Laplace approximation to these contour 
integrals to establish the asymptotic properties of Lp {h; Ap) and Lp [h; /i^) . In 
the r > 1 case, the integrals in ([3]) and (jl]) can be rewritten as integrals over a 
Stiefel manifold, the set of all orthonormal r-frames in MP. Onatski (2012) obtains 
a generalization of the contour integral representation from spherical integrals to 
integrals over Stiefel manifolds. Unfortunately, the Laplace approximation method 
does not straightforwardly extend to that generalization, and we therefore propose 
an alternative method of analysis. 

The second-order asymptotic behavior, as p goes to infinity, of integrals of the 
form /^(p) e^*''*^-^'^'^*^) (dQ) was analyzed in Guionnet and Maida (2005) (Theo- 
rem 3) for the particular case where D is a fixed matrix of rank one, A a deter- 
ministic matrix, and under the condition that the empirical distribution of A's 
eigenvalues converges to a distribution function with bounded support. Below, we 
extend Guionnet and Maida's approach to cases where D = Dp has rank larger 
than one, and to the stochastic setting of this paper. We then use such an extension 
to derive the asymptotic properties of Lp {h; Ap) and Lp [h; jj,^ . 

Let be the empirical distribution of Api,...,App, and denote by F^^ the 
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Marchenko-Pastur distribution function, with density 



(5) 



where = (l — y/c^Y bp = (l + a/c^)^, and a mass of max (0, 1 — c~^) at 
zero. As is well known, the difference between and F^^ weakly converges to 
zero a.s. as p, — )-c oo. Moreover, Api ^ (1 + y/cf , and App ^ (1 — ^/cf' if 



c > 1, and App — > if c < 1. 



Consider the Hilbert transform of F^f^, = / {x - X)'^ AF^'^^ {\) . 

That transform is well defined for real x outside the support of F^^^, that is, on 
the set M\ supp {F^^) ■ Using (ED, we get 



H. 



MP 



X + Cr, — 1 — 



X) 



X Cp 1) 4cp 



(6) 



where the sign of the square root is chosen to be the sign of (x — Cp — 1). It is 
not hard to see that H^^^ (x) is strictly decreasing on ]R\ supp {^p ^^) . Thus, on 
Hp^^ (]R\supp (Fjf^-^)), we can define an inverse function K^^^, with values 



X 1 — CpX 



X 



Gfff^ (M\supp(Fj^^n)- 



(7) 



The so-called i?-transform R^^^ of Fp^^ takes the form 



For e > and rj > sufficiently small, consider the subset of 



eJ7 



+ £,0 U 0, 



for c > 1, 
for c < 1. 
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From dSD, (^\ -PP i^n) = 0) ^ (o, U 

when c, > 1, (- ^(;_^) .o) U (o. ^^en c, < 1, and (-oo,0) U 

(0, 1/2) when Cp = 1. Therefore, Qer, C i/^^^ (M\supp {F^'^)) with probabihty 
approaching one as np,p — j-c oo. 

Proposition 2 Let {©p} &e a sequence of random p x p diagonal matrices 
diag (6*^1, 9pr, 0, 0) , where 9pj 7^ 0, j = 1, r. Further, let Vpj = Rp^^ i'^^pj), 
where R^^^ (x) = 1/ (1 — Cpx) is the R-transform of the Marchenko-Pastur distri- 
bution Fp ^^ . Assume that, for some e > and rj > 0, 26pj G Qs,ri with probability 
approaching one as np,p — i-c 00. Then, 

0{p) 

r j 

X JJ ^1 - 4 {6pjVpj) (OpsVps) Cp (1 + o(l)) a.s., 

j=l s=l 

where o(l) is uniform over all sequences {©p} satisfying the assumption. 

This proposition extends Theorem 3 of Guionnet and Maida (2005) to cases 
when rank (0p) > 1, 9pj depends on p, and Ap is random. When r = 1, 9pi = 



9 > and Vpi = v are fixed, it is straightforward to verify that y 1 ~ ^Cp = 
Va9^/Vz, where Z = J (Kf ^ (29) - A)~^dF/^^ (A) . In Guionnet and Maida's 
(2005) Theorem 3, the expression V A9^ /\/Z should have been used instead of 
which is a typo. 

Setting r = 1 and 9p\ = 2^Y+h Proposition |2] and using formula ([3]) from 
Proposition [1] gives us an expression for Lp{h] Ap) which is an equivalent of formula 
(4.1) in Theorem 7 of OMH. Theorem [3] below uses Proposition [2] to generalize 
Theorem 7 of OMH to the multispiked case r > 1. 
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Let 9pj — hj/2cp (1 + hj) and 



[-l + 5,0)U(0,v^-<5] forol, 
Hs^S (8) 
[-Vc + 5,0)U(0,Vc-5] forc<l. 



The condition hj e Hs for some S > implies that 29pj e for some £ > 0, 
rj > and p sufficiently large. Below, we are only interested in non-negative values 
of hj, and assume that hj G {0,^/c — 6] under the alternative hypothesis. The 
corresponding 9pj, thus, is positive. 

With the above setting for 9pj, we have Vpj — 1 + hj and K^^ i'^^pj) — 
(cp + hj) (1 + hj) /hj = Zjo, say, as in Theorem 7 in OMH. Define 

Ap (zjo) = ^ In (^,0 - Xpi) -P fin (zjo - A) dFj^^ (A) . (9) 

Theorem 3 Suppose that the null hypothesis is true (h = 0). Let S be any fixed 
number such that < S < \fc, and let C [0, ^/c — be the space of real-valued 
continuous functions on [0, ^/c — 8^ equipped with the supremum norm. Then, as 
p,np -^c oo, 

Lp{h;Xp) = flexpl^-lApizjo) + lj^^^^ and (10) 

Lp{h;i,p)=Lp{h;Xp)expi^^^^h)j - ^ (l + o (1)), (11) 

almost surely, where the a (1) terms are uniform in h & [0, a/c — . Furthermore, 
lnLp{h; Xp) andlnLp(^h; /i^ , viewed as random elements ofC [0, — 5\' , converge 
weakly to C\ (h) and Cfj, (h) with Gaussian finite- dimensional distributions such 
thatE{Cx{h)) = -^Vsn {Cx{h)), E(£^(/i)) = Var (£^ (/i)) , and, for any 
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Coy (^C,{h),C, {h 



Coy {Cx{h:),Cx (h 



) 



) 



) 




(12) 



(13) 



Theorem [3] and Le Cam's first lemma (van der Vaart (1998), p. 88) imply that 
the joint distributions of Ai,...,Am (as well as those of fi^, under the 
null and under the alternative are mutually contiguous for any h G [0, y/cY . By 
applying Le Cam's third lemma (van der Vaart (1998), p. 90), we can study the 
"local" powers of tests detecting signals in noise. The requirement that hj be 
positive under alternatives corresponds to situations where the signals contained 
in the data are independent from the noise. If dependence between the signals 
and the noise is allowed, one might consider two-sided alternatives of the form 
Hi : hj ^ for some j. Values of hj between —1 and correspond to alternatives 
under which the noise variance is reduced along certain directions. In view of 
Proposition O it should not be difficult to generalize Theorem [3] to the case of 
fully {hj 7^ 0, all j's) or partially {hj ^ 0, some j's) two-sided alternatives. This 
problem will not be discussed here, and is left for future research. 

3 Asymptotic power analysis 

Denote by (3^^ {h) and (3^ {h) , respectively, the asymptotic powers of the asymp- 
totically most powerful A- and yU-based tests of size a of the null h = against a 
point alternative h = {hi, hr) 7^ with hj < y/c, j = 1, ...,r. As functions of h, 
(3^ and are called the asymptotic power envelopes. 
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Figure 1: The power envelopes I3x (h) (upper panel) and (3^ (h) (lower panel) for 
a = 0.05, as functions of h/y/c = {hi, /i2) / y/c. 

Proposition 4 Let $ denote the standard normal distribution function. Then, 



2 . Y c 



and 



1 



i,j=i ^ ^ ^ ^ 



(14) 
(15) 



Figure [T] shows the asymptotic power envelopes iK) and iK) as functions 
of h\j ^ and h2/^/c when h = (/ii,/i2) is two-dimensional. 

It is important to realize that the asymptotic power envelopes derived in Propo- 
sition m are valid not only for A- and /i-based tests but also for any test invariant 
under left orthogonal transformations of the observations (X i— )■ QX, where Q 
is a p X p orthogonal matrix), and for any test invariant under multiplication 
by any non-zero constant and left orthogonal transformations of the observations 
{X I— 7- aQX, where a G and Q is a p x p orthogonal matrix), respectively. 
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Let ||v4||^ = tr (y4'y4) and WAW^ = (^'^) denote the Frobenius norm and 
the spectral norm, respectively, of a matrix A. Let Ho be the null hypothesis 
hi = ... = hj. = 0, and let Hi be any of the following alternatives: Hi : hj > 
for some j = 1, ...,r, or Hi : T, cr'^Ip, or //i : |S : ||S — a'^Ip\\p > en,p} , or 
ifi : {S : ||S — cr^/p||2 > £n,p}, where en,p is a positive constant that may depend 
on n and p. 

Proposition 5 For specified a'^ , consider tests of Hq against Hi that are invariant 
with respect to the left orthogonal transformations of the data X = [Xi, Xn] • 
For any such test, there exists a test based on A with the same power function. 
Similarly, for unspecified o"^, consider tests that, in addition, are invariant with 
respect to multiplication of the data X by non-zero constants. For any such test, 
there exists a test based on n with the same power function. 

Examples of the former tests include the tests of Hq : = I studied in Chen 
et al (2010) and Cai and Ma (2012). An example of the latter test is the test of 
sphericity studied in Chen et al (2010). The tests studied in Chen et al (2010) and 
Cai and Ma (2012) are invariant, although they are not A- or /i-based. 

For r = 1, OMH show that the asymptotic power envelopes are closely ap- 
proached by the asymptotic powers of the A- and /i-based likelihood ratio tests. 
Our goal here is to explore the asymptotic power of those likelihood ratio tests 
for r > 1. Unfortunately, as r grows, it becomes increasingly difficult to compute 
the asymptotic critical values for the likelihood ratio tests by simulation. For ex- 
ample, r = 2 requires simulating a 2-dimensional Gaussian random field with the 
covariance function and the mean function described in Theorem [3l 

For r = 2, Figure [2] shows sections of the power envelope (dotted lines) and the 
power of the likelihood ratio test based on A for various fixed values of hi/ ^/c under 
the alternative. Figure [3] shows the same plots for the tests based on fi. To enhance 
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Figure 2: Profiles of tlie asymptotic power of the A-based LR test (solid lines) 
relative to the asymptotic power envelope (dotted lines) for different values of 
hijy/c under the alternative; a = 0.05. 

readability, we use a different parametrization: 9j = ^— In (l — /i|/c), i = 1, ...,r. 
As hj varies in the region of contiguity [0, ^/c) , 6j spans the entire half-line [0, oo) . 
Note that the asymptotic mean and autocovariance functions of the log likelihood 
ratios derived in Theorem [3] depend on hj only through hj/y^ = Vl — e^^^ ■ 
Therefore, under the new parametrization, they depend only on = (^i,...,^^)- 
The parameter 6 plays the classical role of a "local parameter" in our setting. 

Figure H] further explores the relationship between the asymptotic powers of 
the A- and /i-based LR test and the corresponding asymptotic power envelopes 
when r = 2. We pick all values of h = {hi, h2) satisfying inequality hi > h2 and 
such that the asymptotic power envelope for A-based tests is exactly 25, 50, 75, 
and 90%. Then, we compute and plot the corresponding power of the A-based 
LR test (solid lines) against h2/hi. The dashed lines show similar graphs for the 
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Figure 3: Profiles of tlie asymptotic power of the /x-based LR test (solid lines) 
relative to the asymptotic power envelope (dotted lines) for different values of 
hi/y/c under the alternative; a = 0.05. 

/x-based LR test. The value h2/hi = corresponds to single-spiked alternatives 
hi > 0, h2 = 0, the value h2/hi = 1 corresponds to equi-spiked alternatives 
hi = h2 > 0. The intermediate values of /i2/^i link the two extreme cases. We 
do not consider values /i2/^i > 1, as the power function is symmetric about the 
45-degree line in the {hi, h2) space. 

Somewhat surprisingly, the power of the LR test along the set of alterna- 
tives (/ii,/i2) corresponding to the same values of the asymptotic power envelope 
is not a monotone function of h2/hi. Equi-spiked alternatives typically seem to be 
particularly difficult to detect by the LR tests. However, for the set of alternatives 
corresponding to an asymptotic power envelope value of 90%, the single-spiked 
alternatives are even harder to detect. 

A natural question is: how does the asymptotic power of the A- and /i-based 
LR tests depend on the choice of r, that is, how do those tests perform when 
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Envelope power 25% 



Envelope power 50% 




Figure 4: Power of A-based (solid lines) and /x-based (dashed lines) LR tests plot- 
ted against /i2/^i, where (/ii,/i2) are such that the respective asymptotic power 
envelopes (3x{h) and (3 {h) equal 25, 50, 75 and 90%. 



the actual r does not coincide with the value the test statistic is based on? For 
example, to detect a single signal, one can, in principle, use LR tests of the null 
hypothesis against alternatives with r = l,r = 2, etc. How does the asymptotic 
powers of such tests compare? Figure O reports the asymptotic powers of the A- 
and yU-based LR tests designed to detect alternatives with r = 1 (solid line) and 
r = 2 (dashed line), under single-spiked alternatives. As in Figures 2 and 3, we 
use the parametrization 9 = \/ — In (1 — /i^/c) for the single-spiked alternative. It 
appears that the two asymptotic powers are very close to each other; interestingly, 
neither of them dominates the other. Using LR tests designed against alternatives 
with r > 1 seems to be beneficial for detecting a single-spiked alternative with 
relatively small Q (and K). 

In the remaining part of this section, we consider examples of some of the tests 
that have been proposed previously in the literature, and, in Proposition [6l derive 
their asymptotic power functions. 
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)--based LR test 



H-based LR test 





Figure 5: Asymptotic power of the A-based (left panel) and //-based (right panel) 
LR tests. Solid line: power when r = 1 is correctly assumed. Dashed line: power 
when r = 2 is incorrectly assumed. 

Example 1 (John's (1971) test of sphericity Hq : Tj = a'^I.) John (1971) pro- 
poses testing the sphericity hypothesis 6 = against general alternatives based on 
the test statistic 



U = -tT 

P 



;i/p)tr S 



(16) 



where S is the sample covariance matrix. He shows that, when n > p, such a test 
is locally most powerful invariant. Ledoit and Wolf (2002) study John's test when 
n,p — j-c oo. They prove that, under the null, nil — p A (1,4) . Hence, the test 
with asymptotic size a rejects the null of sphericity whenever ^ {nil — p — 1) > 
$-1(1 -a). 

Example 2 (The Ledoit- Wolf (2002) test of Hq : i: = I.) Ledoit and Wolf (2002) 
propose 



W =-ii 




_ P_ 


-trt 


p 




n 


.P 



+ 



p 



n 



(17) 



as a test statistic for testing the hypothesis that the population covariance ma- 
trix is the unit matrix. They show that, under the null, nW — p -4 A^(l,4). 
As in the previous example, the null is rejected at asymptotic size a whenever 
l{nW-p-l) >$-i(l-a). 
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Example 3 (The Bai et al. (2009) "corrected" LRT of i/o : S = J.) When 
n > p, Bai et al. (2009) propose to use a corrected version 

CLR = trS -IndetS -P-J9 (^1 - (^l - In (l - 

of the likelihood ratio statistic to test the equality of the population covariance ma- 
trix to the identity matrix against general alternatives. Under the null, CLR -4 
(— I In (1 — c) , — 2 In (1 — c) — 2c) (still, as n,p '^)- The null hypothesis is 
rejected at asymptotic level a whenever CLR + |ln(l — c) is larger than 
(-2 In (1 - c) - 2c)^/^ (1 - a) . 

Example 4 (Tracy- Widom- type tests of Hq : = L) Letip (Ai, A^) he any 
function of the r largest eigenvalues increasing in all its arguments. The asymp- 
totic distribution of {Xi, Xj.) under the null is determined by the functional 
form of (f{-) and the fact that 

{<yn,c (Al - Vc) , (A, - V,)) 4 TW (r) , (18) 

where TW{r) denotes the r -dimensional Tracy- Widom law of the first kind, an,c = 
^2/3^1/6 _|_ 4/3 z/c = (1 + a/c)^. Call Tracy-Widom-typc tests all tests 
that reject the null whenever (p (Ai, A^) is larger than the corresponding asymp- 
totic critical value obtained from l[TS\) . 

Example 5 (The Cai-Ma (2012) minimax test of Hq : E = /.) Cai and Ma 

(2012) propose to use a U-statistic 

nin — l) ^ 

l<«<i<n 

where = {X[X2)^ — {X[Xi -\- X2X2) + p, to test the hypothesis that the 

population covariance matrix is the unit matrix. Under the null, as n,p — j-c 00, 
Tn N (0, 4c^) . The null hypothesis is rejected at asymptotic level a whenever 
Tn is larger than 2a/p(p+ 1) /n{n — 1)$^^ (1 — a). Cai and Ma (2012) show that 
this test is rate-optimal against general alternatives from a minimax point of view. 
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Consider the tests described in Examples 1, 2, 3, 4 and 5, and denote by /3j (h) , 
/^Lw W ) I^CLR W 5 f^cM W ^iid f3rp^, (h) their respective asymptotic powers at 
asymptotic level a. 

Proposition 6 The asymptotic power functions of the tests described in Exam- 
ples 1-5 are 

P^wih) = a, (19) 
Pj{h) = (5L^{h) = (5cM{h) = l-<^(^-\l-a)-]^Yr^^ (20) 

for any h = {hi, hr) ^ such that hj G [0, y/c) for j = 1, ...,r. 

Formula (!20|) for (h) directly follows from Proposition 2 of Cai and Ma 
(2012). The proof of the other formulae follows along the same lines as in the 
proof of Proposition 10 in OMH, and is omitted. Except for the Tracy- Widom 
tests of Example 4, all those asymptotic power functions are non-trivial. Figures 
E] and [7] compare these power functions to the corresponding power envelopes for 
r = 2. Since John's test is invariant with respect to orthogonal transformations 
and scalings of the data, Figure [6] compares (3j (h) (solid line) to the power envelope 
P^{h) (dotted line). The Ledoit-Wolf test, the "corrected" likelihood ratio test, 
and the Cai-Ma test are invariant with respect to orthogonal transformations of the 
data only, and Figure [7] thus compares the asymptotic power functions (3^^^ {h) = 
f^CM W f^cLR W (solid and dashed lines, respectively) to the power envelope 
Pxifi) (dotted line). Note that (^clrW depends on c. As c converges to one, 
PcLR (^) converges to a, which corresponds to the case of trivial power. As c 
converges to zero, PclrW converges to Piwi^) — I^cmW- Figure [71 we 
provide plots of /S^j^/j (h) that correspond to c = 0.5. 
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log{1-h^/c))"' 



Figure 6: Profiles of the asymptotic power of John's test (sohd fines) relative to tfie 
asymptotic power envelope (3^ (dotted lines) for different values of hi/y/c under 
tfie alternative; a = 0.05. 



Tfiese comparisons sfiow tfiat, contrary to our LR tests (see Figures [2] and [3]), all 
tfiose tests eitfier fiave trivial power a (tfie Tracy- Widom ones), or power functions 
tfiat increase very slowly witfi hi and /i2, and lie very far below tfie corresponding 
power envelope. 



4 Conclusion 

Tfiis paper extends Onatski, Moreira and Hallin's (2011) (OMH) study of tfie 
power of fiigfi-dimensional spfiericity tests to tfie case of multi-spiked alternatives. 
We derive tfie asymptotic distribution of tfie log-likelifiood ratio process and use it 
to obtain simple analytical expressions for tfie maximal asymptotic power envelope 
and for tfie asymptotic powers of several tests proposed in tfie literature. Tfiese 
asymptotic powers turn out to be very substantially below tfie envelope. We 
propose tfie likelifiood ratio test based on tfie data reduced to tfie eigenvalues of 
tfie sample covariance matrix. Our computations sfiow tfiat tfie asymptotic power 
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0.5 



0.5 



-log(1-h^/c))''" 



-log{1-h^/c))"' 




log{1-h^/c))"' 



Figure 7: Profiles of tlie asymptotic power of tfie Ledoit-Wolf and Cai-Ma tests 
(solid lines) and the CLR test (dashed lines, for c = 0.5) relative to the asymptotic 
power envelope (dotted lines) for different values of h\l ^fc under the alternative; 
a = 0.05. 

of this test is close to the envelope. 



5 Appendix 

All convergence statements made below refer to the situation when np,p — t-c oo. 
We start with two auxiliary results. 



Lemma 7 Let d (/x, z/) be the Dudley distance between measures n and v defined 
over (R,-B).- 



d {jj,, u) = sup 



/ {djj, — du) 



• / {"x) < 1 and 



<l,Vx^y . 



There exists a constant r > such that d[ F^, F^^^ 



x-y 
= {p~^ log'^p) a.s. 



Proof: Let us denote the cumulative distribution function corresponding to a 
measure /i as F^ (x) . Further, let us denote inf {\x2 — xi\ : supp (/i) C [a;i,a;2]} as 
diam (yu) . Consider the following three distances between measures fi and u : the 
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Kolmogorov distance A; (/i, z/) = sup^, (x) — Fj, (x)| , the Wasserstein distance 
w{^,,i') = sup||/ f {dfi — dh')\ : '^^^IZy'^^^ ^ 1)^3; 7^ ?/|, and the Kantorovich 
distance 7 (yU, v) = j (x) — Fy (x)| dx. As is well known (see, for example, ex- 
ercise 1 on p. 425 of Dudley (2002)), w (/i, z^) = 7 (/i, z^) . Therefore, we have 

d {f^\ F^) < w = 7 {f^\ < k ^) (diam(F,^) + diam(F/^0 

As follows from Theorem 1.1 of Gotze and Tikhomirov (2011), there exists a 
constant r > such that Yl'^=i^^ (^F^, F^ ^^^ >ep~^log^p^ < 00 for all 
e > 0. Thus, k(F^,F^'P^ = o(p~Mog» a.s.. Since diam (F/"^^) is 0(1) and 
diam (^F^^ — diam [F^^^) -> a.s., the result follows. □ 

Corollary 8 Suppose that a sequence of functions {/p(A)} is hounded Lipshitz 
on supp (^Fp^^^ U supp [Fp], uniformly over all sufficiently large p, a.s.. Then 



I /p(A)d [Pp' (A) - F/^^ (A)) = o (p-V2) , a.s 



5.1 Proof of Proposition [2] 

Let us denote the integral / 6^^''^®''^'^^^^ (dQ) as Ip{Qp,Ap). As explained in 

0{p) 

Guionnet and Maida (2005, p. 454), we can write 

/, (e„ A,) = Ea^ exp |p ^"^^ guy'gU) I ' (22) 

where Ea^ denotes the expectation conditional on Ap, and the p-dimensional vectors 
[g''^\ ^('"^) are obtained from standard Gaussian p-dimensional vectors {g^^\ g^^'^) . 
independent from Ap, by a Schmidt orthogonalization procedure. More precisely, 
we have g^^^ = X]fc=i ^jkd^'^K where Ajj = 1 and 

E Ajk9^'^'9^'^ = -9^'^' 9^'^ for t = 1, J - 1. (23) 

fc=i 

In the spirit of the proof of Guionnet and Maida's (2005) Theorem 3, define 
l^f =^/pi^-9^'^'9'^'^-5,}j and ^'^f = {^-g^^^' A^g^^^ -Vp,5,}j , (24) 
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where Sjs = 1 {j = s} stands for the classical Kronecker symbol. As will be shown 
below, after an appropriate change of measure, 7^1*'' and 7^2*'' are asymptotically 
centered Gaussian. Expressing the exponent in ( l22l) as a function of 7pi*'' and 7^2"*'') 
changing the measure of integration, and using the asymptotic Gaussianity will 
establish the proposition. 

Let 7,= 7r'^l -.it"'), where 7?^^ = (7^'^^, 7^^^ 



Using this notation, ( 122|) . (123|) . and (12^ . we get, after some algebra 

/, (e„ A,) = I u, (7,) e^^-^ n n dP i 

j=i i=i 

where P is the standard Gaussian probability measure, and 



iJ) 



fp,e hp) 



.i=i 



with 



(25) 



(26) 



I] 'iG 



p2 



Wpj 



^pi 



-1 



Ipl ) 



-27 



pi \ ^pl "T 



-1 



^pjlpl \^pl ^ -' 



lp2 5 

Ipl 5 



V 



7p2 Ipl \ ^vl ^ ) Ivl 



pi 



Ipl 



-1 



Ipl Tpi 



and 



-1 



Ipl ) 



where Gp^ is a (j — 1) x (j — 1) matrix with {k, s)-th element p "^''^7pi''*'' 



Wpj = diag (wpi, Wpj.i) , and 7 
Next, define the event 



pi 



7 



pi 1 



pi 



I 



B 



M,M' 



7pi 



< M and 



7p2 



< M' for all j,s = 1, ...,r 



where M and M' are positive parameters to be specified later. Somewhat abusing 
notation, we will also refer to Bm,m' as a rectangular region in R''^'^^ that consists 



24 



of vectors with odd coordinates in {—M,M) and even coordinates in {—M',M'). 
Let 

j=i i=i 

where 1 {•} denotes the indicator function. Below, we estabhsh the asymptotic 
behavior of 7^'-^' (O^, Ap) as first p, and then M and M', diverge to infinity. We 
then show that the asymptotics of 1^^'^' (Op, Ap) and Jp (9p, Ap) coincide. 

Consider infinite arrays |Pp^\p = 1,2, = 1, , j — 1, ...,r, of random 

centered Gaussian measures 

(jpO.) ^ y 1 + 29pjVp^ - 2gpjApj ^_i^^2e„,»;,,-2e,,A.i)^^^j._ 

Since Vpj = R^^ C^^pj) = 1/ ~ '^(^pjCp) and 2^pj e O^^, there exists £ > such 

that, for sufficiently large p, 

1 2 
■^pi + > (-'- + V^) + ^ when 9pj > and 



Vpj + — — < —e when < 0. 



Recall that App > 0, and Api — )■ (1 + ^/c) a.s.. Therefore, still a.s., for sufficiently 
large p, Vpj + > Api when 9pj > and Vpj + < App when 9pj < 0. Hence, 
the measures Pp^^ are a.s. well defined for sufficiently large p. Whenever Pp^^ is not 
well defined, we re-define it arbitrarily. 
We have 

jM,M' (-0^^ yY^^ = ^PT,"i=i[OpjVpj-^ Ei=i ln(l+26»p3i;pj-26»pjApi)] jM,M' ^ (^27) 

J,"'*" = / 1 {Sm,m.} /p, (7p) n n <' (sF) • (28) 

i=l i=l 

We now show that, under YYj^i Y[^=i ^^pi (s'l^^) ' converges in distribution 

to a centered + r-dimensional Gaussian vector, so that J^'^' is asymptotically 
equivalent to an integral with respect to a Gaussian measure on M''^+''. 
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First, let us find the mean ]Ep7p, and the variance Vp7p of 7^ under mea- 
sure n;=inr=i<? {g?)- Note thatVp7p = diag(Vp7i^'^)Vp7?''l.,V,7^^)) and 
Ep7p= ^Ep7p^'^'', Ep7p^'"^ Ep7p'"''''' y. With probabihty one, for sufficiently large p, 
we have 

E,7r = v^^-(^E(r ' 



- 1 



ii'*"' (2»pO - A 

which, by Corollary 1, is o(l) uniformly in 26pk G i^^/;) a-S-- That Corollary 1 can 
be applied here follows from the form of expression ([71) for K^^^ (x). Similarly, 

^ V-P^^ jJg^^^,{Pi W-Fr (A)) (1) 
uniformly in 26pk,26ps G ^2^^, a.s.. Thus, 

sup Ep7p = o(l) a.s.. (29) 

{2epj<an^,j,j<r} 

Next, with probability one, for sufficiently large p we have 



ks 



p (1 + 29pkVpk — 29pk\pi) (1 + 26psVps — 26ps\pi) 
T - f dF^W .^.H - r Then 

using Corollary 1, we get 

^vTpi = j?r^KL = j^-^KL + 0(1) a.s., 

uniformly in 2^pfc, 2^ps G Qerj- Similarly, we have 



Xli (1 + 6ks] 



4:6pk0ps 

and 



p (1 + 29pkVpk — 26pkXpi) (1 + 26psVps — 26ps\pij 

^ {2ep^Ki'^ (2^p,) i/^l-2^p.ii'p^^ (2^p,)-2^p.ir,^^ (2^p,)) +c 
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Xpi (1 + 6ks 



P ~^ (1 + 'iOpkVpk — 29pk\pi) (1 + 29psVps — 26ps\ 
1 + hs 



4:6r,u6. 



pk"ps 



uniformly in 26pk,26ps G l^e,,, a.s.. 

A straightforward calculation, using formula (jTj), shows that 

K% = - ^pVsk^ , and V,7?'^) = V^^^'^) + o(l), 

(k s) 

uniformly in 29pk-,'^dps ^ ^er?? £i^-s., where the matrix ' has elements 



(30) 



y{Ks) ^ [l+dks)(}--^0pkVpk9psVskC^~^ , (31) 
^P,i2^ = '^^p,2f^ = (l + 4s)t'pfet'.fe (l-46'pfc^^pfe6'p,t;^fcCp)~\ and (32) 
^p,22^ = (l+f^fcj [cpfpfcf^fc+^^lfcfsfc (l-4V^pfc6'p^t^sfcCp)"^] . (33) 

This implies that 

r 

ps^skCp) J (34) 

k>s 

which is bounded away from zero and infinity for sufficiently large p, uniformly 
over {26'pj G ^eri,j < r}, a.s.. 

By construction, 7p is a sum of p independent random vectors having uniformly 

(i) fjj) 



bounded third and fourth absolute moments under measure 11^=1 Yl^=i dPpi y9i 
Therefore, a central limit theorem applies. Moreover, since the function /p ^ (7^) 
is Lipshitz over Bm,m', uniformly in {2^^pj G ^e-qij < , Theorem 13.3 of Bhat- 
tacharya and Rao (1976), which describes the accuracy of the Gaussian approx- 
imations to integrals of the form (p8l) in terms of the oscillation measures of the 
integrand, implies that 

Jp^'"' = I fp,e (x) d$ (x; Ep7p, Vp7p) + om,m' (1) , (35) 
where $ (a;; Ep7p, Vp7p) denotes the Gaussian distribution function with mean 



27 



Ep7p and variance Vp7p, and om,m' (1) converges to zero uniformly in {26pj G fig^, j < 
as p — )■ oo, a.s.. The rate of such a convergence may depend on the values of M 
and M'. 

Note that, in Bm,m', as p — )■ oo, the difference fp^e (7^) — /p g (7^) converges to 
zero uniformly over {26 pj G ^e-qij < , where 

Ipfihp) = exp|^^^p,(iVi,+ ... + iV4,)|, with (36) 



Ns, = -27l- 



and iV4, = t;p,7S'''^'"'^S'''^'"'^ 



12 ' 



Such a convergence, together with ( l29il . ( 130|) . and ( l35il implies that 



P 



f , (x) d<l> (a;; 0, Vp) + om,m' (1) 



M,M' 



where \/p = diag ( vi'''\ vi'''\ ^/J'^'^) 



(37) 



Note that the difference / ^ (a;) d$ (a;;0,V'p) - / Q (x) d$ (x;QPQ con- 



verges to zero as M, M' — > 00, uniformly in p for p sufficiently large. On the other 



hand, 



where 



r j ^exp 
Jp^, (x)d$ (a:;0,i;)=nn 
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29 pj (vpj -\- Vps) "^^pj 



V 



26pj 







Using 
and 



we verify that, for sufficiently large p, Wp'^'^^ is a.s. positive definite 



det {W^^'^^) = (1 + SjsY CpVpjVps, and 
det {V^^''^) = (1 + 5jsf CpVpjVps (1 -4 {OpjVpj) {9psVp,) Cp)' 



(39) 
(40) 



Therefore, 
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r J 



(a;) d<|. (a;; 0, V,)=l[ J] ^Jl -4 (e.^v,,) (9 

ps^ps) ^p 

j=l s=l 



and, uniformly in p for p sufficiently large, 

r J 

lim I I l^„,(x)d^(x;0,Vr,)-' 

M,, 



Vp^ Cp UO. (41) 



Equations ([27]), ([37]), and ([H]) describe the behavior of J^'*^' (6p, Ap) for large p, 
M, and M'. 

Let us now turn to the analysis of Ip (6p, Ap) — I^'^' (6p, Ap) . Let Bm be the 



event 



I 7j,{''^ < M for all j, s < r|, and let 

/^^ (Gp, Ap) = Ea J 1 {5,,} exp <j p 5^ 9, 



As explained in Guionnet and Maida (2005, p. 455), 7pi'*\ j,s = l,...,r are inde- 
pendent of g^^^' Ag^^y g^^^' g^^\ j = 1, ...,r. Therefore, 

Jp*^ (Bp, Ap) = Ea^ (1 {Bm}) Ip (Op, Ap) = (l - Ea, (1 {BIA)) I, (Gp, Ap) . 

Denoting again by P the centered standard Gaussian measure on M, we have 



Ea„ 1 



> M 



r p 



>M]\[\[dW {^^ 



j=i i=i 



For j ^ s and r G {-l^/p, l^/p) 



r p 



e^^pi JJ JJ dP ( 
j=i i=i 



1=1 



p 



< e 



2t^ 



Therefore, using Chebyshev's inequality, for j ^ s and r G (— ^a/p, ^y/p) 



r p 



j=i i=i 



2t^ 



Mt ■ 



Setting T = M/4 (here we assume that M < 2y/p), we get 



1 < > M 



r p 



j=l i=l 



dP ( gl^ '^ ) < e 
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Similarly, we show that the same inequality holds when 7p{'*'' is replaced by —'^p'^\ 



and thus 



j=i i=i 



(42) 



For j = s, the same line of arguments yields 

r p 



1 1 {|7?'-''^| > M} fJJJdP ((??^) < 2e-^'/i^ (43) 
j=i i=i 

Inequalities ^ and (gS]) imply that Ea^ (1 {3%^}) < 2r'^e~^^^^^^ , and there- 
fore, for sufficiently large p, 

Ip {Op, Ap) > (Op, Ap) > (l - 2r2e-*^'/i6 j _ 

Note that 



(44) 



(45) 



where 



J, 



MM'oo 



r p 



1 {Bm\Bm,m'} fp,e ilp) n n 

j=l i=l 



We will now derive an upper bound for .7^^'^^''°°. 

From the definition of fp^g (7p) , we see that there exist positive constants (3^ 
and (32, which may depend on r, e and r], such that for any 9pj satisfying 29pj G Qs-q, 
j < r and for sufficiently large p, when Bm holds. 



fp,e {7p)<exp\(3,MY, hp2 



ik,s) 



,k=l 



Let Sgj,, = Bm n ||7f/^| = max,-^<, |7^i'™)| > M'}. Clearly, Bm\Bm,m' 



UL=i^m'^- Therefore, 



MM',00 



< 



E / 



r p 



/3iMr2|7^'2='|+/32M2 



j=l i=l 

r p 



j=l i=l 



First assume k ^ s. Denote Xpi{l — 29pkXpi + 26pkVpk) (1 — 26ps\pi + 2^, 



ps'^ps) 



as Api and (1 — 26pjXpi + 29pjVpjf^'^ g^^^ as g^-^K Note that, under P^^-*, is a stan- 
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dard normal random variable. Further, as long as 29pj e Jl^^ for j < r, Xpi consid- 
ered as a function of Aj is continuous on Aj e suppF^ for sufficiently large p, a.s.. 
Hence, the empirical distribution of A^ converges. Moreover, Amax = niaxi=i_..._p(Api) 
and Amin = niini=i^...^p(Api) a.s. converge to finite real numbers. Now, for r such 
that |t| < y^/(2Amax), wc havc 



/ 



0=1 i=l 

1=1 i=l V ^ 



for sufficiently large p, a.s.. Using this inequality, we get, for sufficiently large p 
and any positive t such that /3ir^M + t < ^Jpj (2Amax), 

7p2' j=l t=l j=l i=l 



Setting t = -/^ir^M (here we assume that M and M' are such that t satisfies 
the above requirements), we get 



p2 ^™ j=\ i=\ 

(k s) (k s) 

Replacing 7^2' by —7^2' iii the above derivations and combining the result with 



the above inequality, we get 



When k = s, following a similar line of arguments, we obtain 
and thus, for sufficiently large p, 
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Finally, combining (jS]), (H5|) . and ( H6|) . we obtain for 
the following upper and lower bounds: 



(47) 



J. 



M,M' 



< Jp< { 1 -2rVT^ 



(4J 



Let r > be an arbitrarily small number. Equations ( 1371) and fHTl) imply that 
there exist M and M' such that, for any M > M and M' > M' , 



r 3 



for all sufficiently large p. Let us choose M > M and M' > M' so that 



r 

<4 



1 - 2r^e-^ < 2 



1 - 2r^e ) 4r^e 



2 N -1 



(M'f 



< 



and 



2N -1 



1 - 2r^e-T6" 



1 



r J 



for all sufficiently large p, a.s.. Then, ( 148|) implies that 

r j 



Pj^Pj) vps^ps) ^p 



< T 



(49) 



j=l s=l 

for all sufficiently large p, a.s.. Since r can be chosen arbitrarily, we have, from 



dSD and (Uni), 



X I Y 1-4(9 

pj'^pj) ^ps^ps) ~l~ o(l) 

yj=l s=l 



where o(l) — )■ as p — )■ oo uniformly in {29pj G ^e,;, j < r} , a.s..n 
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5.2 Proof of Theorem [3] 

Setting = 2^^^, we have v^j = l + hj, OpjVpj = ^, and 

In {l + 29pjVpj-29pj\pi) = In + ^^^J ~ -^p*) ' 

Further, by Lemma 11 and formula (3.3) of OMH, / In {zjo - A) dF^'^ (A) = ^ - 
— In (1 + /lo) + In for sufficiently large p, a.s.. With these auxiliary results, 

formula ( fTOl) is a straightforward consequence of ([3]) and Proposition 2. 
Turning to the proof of ffTTl) . consider the integrals 

In what follows, we omit the subscript p in to simplify notation. Note that X (0, oo) 
is the integral appearing in expression (jl]) for Lp(/i;/ip). Let us now prove that, 
for some constant a > 0, 

X{0,oo)=X{p-ay/p,p+ay/p){l + o{l)), a.s. (50) 

where o(l) is uniform in h E [0, y/c — 6^ ■ 

Since, by Corollary 1, Sp/p — )■ 1 a.s., the set Hs is bounded from below, and 
Api — 7- (1 + y/c)'^ a.s., there exists a constant Ai > that depends only on 6 
and r, such that inf |-g ^_^j'-pxtr (DpQ'ApQ) / Sp > —Aix/2 for all a; > and all 
sufficiently large p, a.s.. Therefore, for all h G [0, y/c — Sf , 



oo 







and, using Stirling's approximation, we get 

1(0.00) > (^) {^) e--(-j (1 + 0(1)) 

= p 2 e ^ / — (l + o (1)) , a.s. (51) 

\np J 

Next, there exists a constant ^2 > such that, for all x > and all sufficiently 
large p, snp ^^^^^ ^^^y px ti (DpQ'ApQ) / Sp < A2X/2, a.s.. Therefore, a.s., for all 
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sufficiently large p, 



np 



where T y) is the complementary incomplete Gamma function (see Olver 1997, 
p. 45) with y = {p + a^/p) ( "~2^^ ) • Hence, for sufficiently large p, y > np/2 + 
nay/p/4:, and we can continue 



np 

X(p+a^,oo)< — - — r — , — + — ; — 1 , a.s. 



2 ) V 2 ' 2 4 

Now, r (/3,7) < e-^7''/(7 - /3 + 1) whenever /3>land7>/3-l (Olver 1997, 

p. 70). Therefore, we have, for sufficiently large p, 

np 

an^/A + 1 



X {p+a^Jp, oo) < 1 



2 3 4 
np a n i a n _ a n_ 

np A2P,A|pe 2 16 ^-48Vp 128p 

P 2 e 2 + 4n __ (1 + oil)) 



np np (3 



2 

„ I A an 
P ^2 



32p 

< p^e-^ ^ ' (l + o(l)), a.s. 

any/p/A + 1 



Comparing this to flSTj) . we see that a can be chosen so that 

X(p+ay^, oo) = o(l)X(0, oo) , a.s. (52) 
Further, for sufficiently large p, 

X(0,p-a^) < / X 2 2 ^dx 



' ' t 2 dt, a.s., 



where y = {p — d^/p) """^^ < ^ ~ ""2^ . Therefore, for any positive z < ^ and 
sufficiently large p, 

np np an^ 

X(0,p-«v^) < f^-^j / t--^e-*dt 







71 p 
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Setting z = an^/A and using Stirling's approximation, we have, a.s.. 



np 1 



2 4yV2/V2 4 

so that 

np np 1 

X(0,p-av^) < [-^) [y 4 j e--+^v^(l + o(l)) 

< 2eV / (1+0 (1)) , a.s.. 



Comparing this to ( ISTl) . we see that a can be chosen so that 

X(0,p-av^) = o(l)X(0,oo), (53) 

a.s.. Combining ( 152|) and ( 153|) . we get ( l50l) . 

Now, letting = ;f-6'pj = ;f-2^ yT/T' '^^^^ ^^^^ there exist e > and > such 
that |2^pj : hj G [0, a/c — 5] and x G [p — Oy^, p+a^^] | C Q^^ for all sufficiently 
large p, a.s.. Hence, by ( |50l) . and Proposition 2, a.s.. 



J(0,oo) = / a;^-ie-t^e^^^'=i[''"*""^^^-i'''(^+^''p^'*""^''"^''*)] (54) 

Jv-a,/v 



r J 



X 



n n V (^pi^pj) (^ps^ps) + o(i) I dx. 



where o(l) is uniform in /i G [0, y/c — 5Y and x G [p— Oy^, P+Oa/p] • 

Expanding OpjVpj - j^Yl^=i^ri (^l + 29pjVpj-29pj\p^^ and {OpjVp}j {OpsVp^ into 
power series of ^ — 1, we get 

|. p(Bo+Bi(f-l)+B2(f-l)') 



I(0,oo) = / 



X 2 2-6' 

p-^Vp 



X 



n n V ^ ~^ ('^pi^pi) ('^ps^ps) Cp + o(l) j dx. 



where i?o, -Bi and are 0(1) uniformly in /i G [0, -\/c — 5]^ . Further, consider the 



integral /"^^^liE^i .n^pfsi^B^f^-i 



\2 



'p-«v^ 
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Splitting the domain of integration into segments [p — Q;^/p,p— ap'''] , \p—ap'^,p+ap^ 
and [p+ap'^ ,p+a^yp\ , where < 7 < 1/2, and denoting the corresponding inte- 
grals by and respectively, we have 

7^^^ > / x^~'^e~^''e^^''dx > p^ ll J / y^'^e^^^dy, and 



Using the Laplace approximation, we have 

Tip -1 np / \ I np -1 np 

y^~ e~^^ay = o(l) / y^~ e~^^ay, and 







2^ 

/ 7/ 2 2 = o(l) / y 2 2 ^dy. 



SO that 7^^^ dominates I^^^ and /^^^ and 

r"pfap' 



= (1 + 0(1))/ x^-ie-M^^i+^^(i-^) )dx 

/•p^p'' 

= (1 + 0(1)) / x^^-^e-t^e^^^dx 

>/p-apT 

= (1 + 0(1)) / x'^-V^^e^iMx. 



This imphes that 



X [ J]^ JJ ^Jl-A{0pjVpj) (§psVp^Cp + 0(1) j dx. 



and hence, only constant and linear terms in the expansion of 
OpjVpj — ^ YTi=i 1^ ^l+S^pj'Cpj — 2^pjApi^ into power series of | — 1 matter for the 
evaluation of X(0, 00) . Let us find these terms. 
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By Corollary 1, ^ — ~ p ~ ^ ~^ ^) Using this fact, after some 

algebra, we get 



9 v =9 -v +9 -v"^- 
jpj upj upj -r yp-) Upj 



and 

±\n{K-^ [29p)-Xp) = X:in(i^r(2^..)-V)-P(l-4C,^>J.)(^-|) 
i=i 1=1 ^ ^ 



X 


Sp 


p 


P 




Sp 




P . 



X 

p 

2 

X 
P 



+01 (--l' 

.P 



It follows that 



X(0,oo) = / a;T-ie-t^e^^^'-i[^^^'^"~^^^=i^°^^+^^"''""^''"^'''^] (55) 

Jv-a^/p 



/ r j 



j=l s=l 



= {l + o{l))\{{l + h^)"-^ Lp{h-\) x^-i 

where the last equality in f l55|) follows from (|3]) and Proposition 2. 
The last equality in (1551) . (jlj) and the fact that 

r(^) (1+0(1)) 

imply that 



np 

hi 



Lp{h-f,p) = {l + o{l))Lp{h-Xp)e^^-^-^'^ — 

\ j=i ^^P 



which establishes ( jTT|) . The rest of the statements of Theorem 1 follow from I^U 
(fTTj). and Lemmas 12 and A2 of OMH.D 
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5.3 Proof of Proposition [4] 

To save space, we only derive the asymptotic power envelope for the relatively more 

difficult case of real-valued data and /i-based tests. According to the Neyman- 

Pearson lemma, the most powerful test of /i = against the simple alternative 

h = {hi,...,hr) is the test which rejects the null when Lp{h] is larger than a 

critical value C. It follows from Theorem 1 that, for such a test to have asymptotic 

size a, C must be , 

C = (1 - a) + m {h) , (56) 



where 



1,7 = 1 ^ ^ 



hihj\ 


^ hihj^ 


c J 




hihj ^ 

C I 


\ hjh^ 

1+ / 



W{h) 



Now, according to Le Cam's third lemma and Theorem 1, under h = {hi, hr) , 
In Lp{h; fij) A N{m{h) + W (h) , W (h)) . The asymptotic power ([15]) follows.D 

5.4 Invariance issues and Proof of Proposition [5] 

Before turning to the proof of Proposition [5l let us clarify the invariance issues 
in the problem under study. For basic definitions (invariant, maximal invariant, 
etc.), we refer to Chapter 6 of Lehmann and Romano (2005). 

Suppose that X is a p x n random matrix with vec (X) ~ N (0, /„ ® S). This 
model is clearly invariant under the group Qp, acting on M*'^", of left-hand multi- 
plications by a p X p orthogonal matrix x t— )■ Qx, x G M^^", Q ^ O (p) ; so are the 
null hypothesis Hq and the alternative Hi. Letting m = min(n,p) , the m-tuple 
A (X) = (Ai,...,Am) of non-zero eigenvalues of ^XX' is clearly invariant under 
that group, since ^xx' and ^ (Qx) (Qx)' = ^Qxx'Q' share the same eigenvalues 
A (x) for any orthogonal matrix Q and any matrix x G M^^". However, A (X) is 
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not maximal invariant for G„, and {xP) {xPy = xPP' ' where P is 

an arbitrary n x n orthogonal matrix, share the same A (x) = A (xP) although, in 
general, there is no p x p orthogonal matrix Q such that xP = Qx. 
Now, the joint density of the elements of X is 

/^") (x) = (27r)-"P/' |Sr"/'exp |-^tr {^-'xx')^ x E M^^". 

By the factorization theorem, XX' is a sufficient statistic, and it is legitimate to 
restrict attention to XX'-measurable inference procedures. Left-hand orthogonal 
multiplications Qx of x yields, for xx' , a transformation of the form Qxx'Q' . When 
Q range over the family Op of p x p orthogonal matrices, those transformations 
also form a group, Qp, say, now acting on the space of p x p symmetric positive 
semidefinite real matrices of rank m. Clearly, A (x) is maximal invariant for Qp, as 
xx' and yy' share the same eigenvalues if and only if yy' = Qxx'Q' for some p x p 
orthogonal matrix Q. 

Combining the principles of sufficiency and invariance thus leads to considering 
A-measurable tests only. 

A similar reasoning applies in the case of unspecified o"^, with a larger group 
combining multiplication by an arbitrary non-zero constant with the p x p left 
orthogonal transformations. Sufficiency and invariance then lead to restricting 
attention to /^-measurable tests. 

Proof of Proposition [5l 

With the same notation as above, write T = T (X) = XX' for the sufficient 
statistic. Consider an arbitrary invariant (under the group Qp of left orthogonal 
transformations of M^^") test (X), and define (t) = E (0 (X) \T = t). Then 
ip (T) is a T-measurable test with the same size and power function as (X) . It 
follows from the proof of Theorem 6.5.3 (i) in Lehmann and Romano (2005) that 
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ip (T) is almost invariant. Moreover, since the conditions of Lemma 6.5.1 (same 
reference) hold, this test is invariant under the group Qp (acting on T). Since the 
ordered m-tuple Ai, \m of the eigenvalues of = ^XX' is maximal invariant 
for Qp, and since any invariant statistic is a measurable function of a maximal 
invariant one, ip (T) must be A-measurable. Hence, ip (T) is a A-measurable test 
and has the same power function as (p {X) , as was to be shown. 

The existence of a //-measurable test with the same power function as that of 
a test (p {X) invariant under left orthogonal transformations and multiplication by 
non-zero constants is established similarly. □ 
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